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Abstract. - The statistics of records for a time series generated by a continuous time random 
walk is studied, and found to be independent of the details of the jump length distribution, as 
long as the latter is continuous and symmetric. However, the statistics depend crucially on the 
nature of the waiting time distribution. The probability of finding M records within a given time 
duration t, for large t, has a scaling form, and the exact scaling function is obtained in terms of 
the one-sided Levy stable distribution. The mean of the ages of the records, defined as (t/M), 
differs from t/{M). The asymptotic behaviour of the shortest and the longest ages of the records 
are also studied. 
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People's interest in records is evident from the popular- 
ity of the Guinness World Records — which itself holds 
a record as the best selling copyright book. An observa- 
tion is called a (upper) record if its value exceeds that 
of all previous observations. Applications of records are 
found in diverse fields such as meteorology |l! , hydrol- 
ogy [2], economics j3j, and sports [I]. Records also play 
an important role in highlighting climate data to increase 
public awareness of climate change and global warming — 
e.g., a recent analysis from the United States National Cli- 
matic Data Center reveals that the global combined land 
and ocean average surface temperatures for June, April to 
June average, and January to June average, of 2010, are 
the warmest on record [5j. 

Frequently asked questions in the study of records in- 
clude: (a) How many records occur in a given dura- 
tion? (b) How long does a record stay before it is bro- 
ken by a new one? In the case of a discrete time series 
{xq,xi,X2, ■ ■ ■ ,xn} consisting of independent and iden- 
tically distributed (IID) random variables, the statistics 
regarding the above questions are well-studied [6j|9] - 
which have been useful in understanding and analysing 
evolutions in complex systems, ranging from phase slip 
in charge density waves, ageing in glassy systems, to bi- 
ological macroevolution 
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and adaptation 
record statistics for independent but non-identically dis- 
tributed entries have been studied recently in the context 



of biological evolution and warming climate 13 ■ 15 



dom variables — while a time series often contains cor- 
related entries. It is only recently that Majumdar and 
Ziff 16 1 initiated the study of record statistics for corre- 
lated entries — by considering a time series generated by 
a "discrete time random walk" (DTRW): Xi — Xi-i + £j 
with IID jump lengths A subsequent study of record 
statistics of a random walk with Cauchy distributed jumps 
and a drift has appeared in the context of a driven par- 
ticle in a random landscape 17 . These studies, however, 



assume that the samples are collected at regular intervals, 
say To, so that the number of entries N, in a given time 
duration t, is fixed: N = t/ro. This is not the case in many 
real situations where the events happen at irregular inter- 
vals. Consequently for a fixed time duration t, the number 
of entries in the time series {x(to), x(ti), xfa), ■ ■ ■ , x(t]y)} 
is random and, therefore, the results for fixed N would 
not hold. Thus, the natural first step would be to study 
the statistics of records in the simplest possible model of 
correlated time scries in continuous time. In this Letter 
we carry out this important step. 

The well-known model that takes into account of the 
random waiting times r, = tj — between successive 
steps of a random walk, is the so-called "continuous time 
random walk" (CTRW), that was introduced by Montroll 
and Weiss [18], and ever since has been successfully used 
to describe anomalous diffusion in various complex sys- 

In this 



19-23 , including finance and economics 24 



In spite of the continued interest in records, the theory 
has been mostly restricted to the case of independent ran- 



tems 

Letter, we obtain exact asymptotic results for the statis- 
tics of the number and the ages of records, of a time se- 
ries generated by the CTRW with the IID waiting times 
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drawn from a probability density function (PDF) p(r), 
and IID jump lengths drawn from a continuous and sym- 
metric PDF (j>(£). The results are independent of <j>(£) 
- and the reason for this universality is the same as in 
Ref. 
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, namely, the use of the Sparre- Andersen theo- 
26 . However, with respect to the nature of p(r), 



rem 

the statistics of records within a given time duration t, 
display rather rich scaling behaviour for large t. They are 
essentially determined by the behaviour of the Laplace 
transform p(s) = / °° e~ ST p{r) dr = 1 — (t s) q + ■ • • as 
s — > 0, where a = 1 as long as the mean waiting time is 
finite, whereas the latter becomes infinite for < a < 1. 
The asymptotic results of Ref. 



16 correspond to the spe- 



cial case a = 1 of the more general model considered in 
this Letter. 

We first summarise our main results. We find that, the 
probability P(M, t) of finding M records within a given 
time duration t, for large M and t with the scaled variable 
M(£/to) -q / 2 fixed, has the scaling form 



P(M,t) ~ (t/T y a/2 g, 



g a (a/(</t ) 



-a/2 



(1) 



The scaling function is given by 

g a (x)^(2/a)x^ 1+2 ^L a/2 (x^ a ), < a < 1, (2) 

where L^(x) is the one-sided (j3 — +1) Levy stable PDF. 
The asymptotic behaviour of the moments of the number 
of records is given by 



<M") 



T{ya/2) \r a 



x/2 



(3) 



The mean of the ages of the records (I) = (t/M), grows as 
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1— a/2 



(4) 



where ^(x) = r'(x)/r(x) is the digamma function. On 
the other hand, t/(M) ~ r r(l + a/2)(t/r ) 1 - Q / 2 . As for 
the extreme ages of the records: the mean shortest age 
grows as (Z m i n ) ~ To[r(l — a/2)] (t/ro) 1 ""/ 2 whereas 
the mean longest age has a linear growth (Z max ) ~ C a t 
with the growth constant given by 



dx 



1 



iT a/ V 



(5) 



Now we proceed to obtain the joint probability distri- 
bution of the ages and the number of records in a given 
duration t — from which the statistics of individual vari- 
ables can be computed by integrating out the rest. In this 
context it is useful to refer to Fig. [T] Consider a time se- 
ries {x(0), x(t±), xfe), ■ ■ ■ ,x(tff)} generated by a CTRW 
with the IID jump sizes £j = x(t{) — x(ti-i) drawn from a 
continuous and symmetric PDF </>(£) , and the IID waiting 
times Ti = ti—ti—i between successive jumps drawn from a 
one-sided PDF p(r). We have set the initial time to = 0, 
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Fig. 1: (colour on-line). A realisation of a CTRW in the 
time interval [0, t]. Filled circles (in red) show the positions of 
the walker immediately after the jump. The horizontal lines 
between successive steps show the waiting times, whereas the 
vertical lines show the step sizes. Ri and Ti denote respectively 
the value and the time of occurrence of the i-th record. U 
denotes the age of the i-th record. The number of records 
M = 4 for this particular realisation. 



without loss of generality. Clearly, the total number of 
steps AT, taken within a fixed time duration t — such that 
tw < t < ijv+i — is a random variable that varies from 
one realisation to another. Now for any given realisation 
of the walk, an entry x(ti) is a record iff x(U) > x(tj) 
for all j < i. Let R n and T n denote the value and the 
time of occurrence, respectively, of the n-th record. By 
convention, the first entry is taken to be the first record, 
i.e., i?i = x(0), and T\ = 0. Evidently, i?„+i > R n and 
T n+ i > T n for all n > 1. Moreover, during the time in- 
terval between any two successive records, say the n-th 
and the (n + l)-th, the random walk does not exceed the 
earlier record value R n . Therefore, the (n + l)-th record 
is the event in which a random walk starting from the po- 
sition R n at time T n , exceeds its starting position R n at 
time T„_|_i for the first time. It is evident that the PDF 
of the above first-passage process — let it be denoted by 
/(/) — depends neither on the actual value R n nor on 
the individual times T n and T n+ \, but solely on the time 
difference / = T n +i — T n . Let M denote the total num- 
ber of records in the interval [0, t]. Then, after the time 
Tjvf, the random walk does not exceed the value Rm up 
to t, since there are no records in the interval (Tjv/,i]. 
The probability of not exceeding the starting position Rm 
solely depends on the time duration I = t — Tm- Let 
q(l) denote this survival probability. Let l n be the age of 
the n-th record, which is defined as: l n = T n+ i — T n for 
1 < n < M, and Im = t — Tm- Now, due to the Markov 
nature of the random walk, different inter-record intervals 
are uncorrelated. Thus, the joint distribution of the ages 
{li} = {h,l2, ■ ■ ■ ,1m} and the number M of records can 
be written as 



P({h},M,t) = 



M-l 



n /ft) 




(6) 



where the 5-function ensures that the ages add up to the 
total time duration t. 
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The first-passage properties of the CTRW are known 
to be related to that of the DTRW [18]. Let 
/„ be the probability that a random walk exceeds 
its starting position for the first time at the n-th 
step. Then f(l) = Y^=\fnPn{l), where p n (l) = 
/ °° dn ■ ■ ■ / °° dr n [Uti P(r0] 5(1 - Eti n), is the PDF 
for the occurrence of the n-th step at time I. It is 
convenient to take a Laplace transform, which yields 
70) = J™e~ sl f(l)dl = E«=l/nP(*)] n . Similarly q(l) 
can be related to the probability q n that the random 
walk does not exceed its starting position up to the step 
n. Unlike the first-passage event, in this case the n-th 
step could occur at any time r in the interval (0, 1), and 
then the walker does not move for the remaining dura- 
tion r n+ i = I — t (cf. Fig. [I]). The probability that the 
walker remains fixed at least for a duration of r n+1 is 
equal to J T i p(r) dr — which has the Laplace trans- 
form s _1 [l — p(s)}. Therefore, the Laplace transform of 
q(l) reads: q(s) = s _1 [l - p(s)] J2™=a In [p(s)] n . Now, as 
long as the PDF </>(£) of the jump lengths is continuous 
and symmetric, according to the Sparre-Andersen theo- 
rem 25 26 , both /„ and q n are independent of </>(£) - 
the respective generating functions read: J^Li fnZ n = 
v 7 ! - z and J2n°=o = V - z - 



1 - v 7 ! - z and J21 
Laplace transforms 



These yield the 



f(s) = 1-^/1- p(s) and q(s) = s- 1 [l-f(s)\. (7) 

It is now evident that the joint distribution given by 
Eq. ([6]), depends only on the waiting time distribution 
p(r), and not the jump distribution </>(£). 

Let us now compute the probability distribution of the 
number of records by integrating out the ages in Eq. Q, 
i.e., P(M,t) = f™dlyf™dl M P({k},M,t). In this 
regard, it is convenient take a Laplace transform with re- 
spect to t, that disentangles the global constraint imposed 
by the 5-function. This yields 



P(M,t) e~ st dt = 



l-f(s) 



M-l 



(8) 



for M = 1, 2, . . . , oo. The normalisation J2m t) = 1 

can be readily verified by summing the above equation 
over M. 

The behaviour of P(M, t) for large t, can be extracted 
from the small s behaviour of Eq. Q — which, according 
to Eq. Q , points to the small s behaviour of the Laplace 
transform p(s). Now, as long as the mean waiting time 
is finite — i.e., p(r) decays faster than the power-law 
tail t -2 at large r, — the Laplace transform behaves as 
p(s) = 1 — tq.s + ■ ■ ■ as s — > 0. On the other hand, if wait- 
ing time distribution has a slower decay, p(r) ~ T ~^ 1+a ^ at 
large r, with < a < 1, the mean waiting time is infinite, 
and the Laplace transform p(s) — 1— (TQs) a + - ■ ■ as s — >• 0. 
Considering both the above cases together, from Eq. (JT]) 
we get, f(s) ps 1 — (t s) q / 2 as s — > 0, where < a < 1. 
Thus, taking the limit of small s and large M with keeping 



Ms -' 2 fixed in Eq. (|J, results in f™P{M,t) e - st dt ps 

ro(r s) Q / 2_1 e _M ( To;i ) a ". This equation suggests that in 
the scaling limit M — > oo, t — > oo while the scaled vari- 
able M(t/T )- a ' 2 is kept fixed, P(M,t) should have the 
scaling form given by Eq. ([T]), so that 



-sy 



y- a/2 gJy~ a/2 ) 



dy 



e a/2-l p - s ° 



(9) 



To evaluate the scaling function g a [x), it is useful to note 
that e~ sf * is the Laplace transform of the one-sided Levy 
stable PDF L^(y) [19]. From this, it immediately follows 
by differentiation that J °° e~ sy [y L fl (y)] dy — p,s fJ -~ 1 e~ s " . 
Comparing this equation with Eq. the scaling func- 
tion can be expressed in terms of the one-sided Levy sta- 
ble PDF as in Eq. The normalisation of the PDF 
of the scaled number of records x = M(t/ro)~ a / 2 can be 
checked by integrating Eq. |2]) with the change of variable 
y = x~ 2 l a and subsequently noting that L fJi (y) is nor- 
malised to unity, i.e., J Q g a (x) dx = J Q L a / 2 (y)dy = 1. 
Incidentally, the PDF of the maximum displacement of a 
CTRW in a time interval t, also has a scaling form sim- 
ilar to Eq. ([lj, with the scaling function being identical 
to Eq. ([2]) provided that the jumps are distributed accord- 



ing to a narrow distribution having a finite variance 27 
However, it is important to realise that the record statis- 
tics of the CTRW is completely independent of the jump 
distribution as long as the PDF of the jump lengths 
is continuous and symmetric — which also includes Levy 
flights where ~ M is power-law distributed for 

large |£| with < p < 2 and thus has a divergent second 
moment. 

In general, L^(y) in Eq. Q does not have a closed- form 
expression. However, expressing the right hand side of 
Eq. as a series in s and then evaluating the inverse 
Laplace transform of the series, term by term, one gets 



7T ' * 



(-x) 



k - 1 1 a 

y-r(k- 1 sni 



(10) 



This series representation is particularly useful for the nu- 
merical evaluation of g a (x). Using the behaviour of L^(y) 
for small y 19 , we find that for large x, 



(far) 



V(2-a)7r 



exp 



I -Mr 



(ll) 



Figure [5] compares the analytical g a (x) with the numeri- 
cally obtained densities of the scaled variable M(t/To)~ a f 2 
for a = 2/3 and a = 1. In fact, for these particu- 
lar values of a, one has the explicit forms: 52/3^) = 
{y/x~/ir)K 1/3 (2(x/3) 3 / 2 ) where K v (z) is the modified 
Bessel function, and g±(x) = e~ x2 / 4 / y/n respectively. The 
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latter agrees with the scaling function found in Ref. 
for the DTRW, as expected. Numerical simulations of the 
CTRW are performed with both Gaussian and Cauchy 
distributed jumps to show the universality with respect to 
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M(T/0 a/2 

Fig. 2: (colour on-line). Density of the scaled records. For 
a = 2/3, there are three curves that lie on top of each other: 
two of them (in green and blue) are obtained from numerical 
simulations (see text) and the third one (in red) plots the theo- 
retical g 2 /3(x). For a = 1 there are five curves: four (in green, 
blue, magenta and cyan) from the simulation (see text) and 
one (in red) plots gi(x). Inset displays the curves in semi-log 
scale. 




10 10 2 10 3 10 4 10 5 10 6 10 7 10 8 

Fig. 3: (colour on-line). Different averages that are indicated 
on the curves, for a = 2/3. The points (in blue) are obtained 
from numerical simulations by averaging over 10 6 realisations 
of CTRW. The lines (in red) plot the analytical asymptotic 
results reported in the text. Upper inset plots the ratio of 
(t/M) to 1/{M). Lower inset zooms the portion shown. 

ating with respect to s, one gets 



the jump distributions. For a = 2/3, the waiting times are 
drawn from a power-law tail p(r) ~ r~ 5 / 3 . For a = 1, sim- 
ulations are performed with both power-law p(r) ~ r~ 5 / 2 
and exponential p(r) = e~ T distributions of the waiting 
times to show the universality of the results for finite mean 
waiting time. Each simulation is performed with 10 7 re- 
alisations of the CTRW, while taking (t/ro) — 10 7 for 
a = 2/3 and (t/r ) = 10 5 for a = 1. 

To derive the moments (M v ) = *£, M M u P(M,t) for 
large t, we use the scaling form given by Eq. Q, and 
subsequently replace the sum over M by an integral over 
the scaled variable, which is justified for large t. This 
gives (M v ) a A a<v (t/r ) ra / 2 with A a<v = x v g a (x) dx. 
Now substituting g a {x) from Eq. (pi) and then mak- 
ing a change of variable y = x~ 2 <~^ we get A a , v = 

S™y~ va/2 L a/2 {y)dy = (2/a)r(i/)[r(i/a/2)]~\ where in 



the last part we have used the known expression 19 for 
the negative moments of L^(y). 

Let us now turn our attention to the statistics related to 
the ages of the records. The mean of the ages of the records 
for a given time interval t, is (I) = (A/ -1 X)i=i h) = 
(t/M). To compute this average, it is convenient to con- 
sider the moments generating function of M. Multiply- 
ing both sides of Eq. (Jsj) by z M_1 and summing over M 
gives J °° dt e~ st £~ =1 z M ~ l P{M, t)= s - l [l- J{s)] [l- 
zf(s)] . Now integrating over z in [0, 1], and different i- 



dt e 



\M/ 




l-f(s) • (12) 



The asymptotic behaviour of the mean of the ages, for 
large t, emerges from the singularities near s = in 
the abov e equation. The inverse Laplace transform of 



Eq. |12], with [1 - /(«)] -> (r s) Q / 2 and f{s) -> 1, 
yields the result displayed by Eq. Q. It is interest- 
ing to note that (t/M) given by Eq. Q differs from 
t/(M) - T r(l + a/2)(t/ Tn ) 1 - a / 2 , where in the latter 
we have used first moment from Eq. ([3]). This non-self- 
averaging behaviour can be traced back to the broad dis- 
tribution /(f) - H 1+Q / 2 ) of the inter -record intervals. It 
should be emphasised that the mean of the ages of the 
records in a given time interval t, is to be distinguished 
from the mean inter-record interval — in fact, the latter 
is infinite. The distinction between the two means arises 
from the definition of the age of the last record Im in a 
given time interval t (see Fig.[T]). 

The statistics of the longest age of the records is similar 
to that of the longest excursion in a renewal process that 
was studied recently [28]. Therefore, skipping details, 
we cast the result in our context. The mean longest age 
of records in a given time duration t, for large t, reads 
(imax) ~ Ca t, where C a is given by Eq. 

The mean shortest age of the records for a given t 
is, evidently, (l m - m ) — j Q G(u, t) du, — where G(u, t) = 
Probffmin > u] is the cumulative distribution of the short- 
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est age. The Laplace transform of (l m i n ) is given by 



dt 



du — 
o 1 



di 



Cf(l)e-°idV 



(13) 



where the integrand — that expresses / °° e~ st G(u, t) dt 
- is obtained by taking a Laplace transform of Eq. ^ 
with respect to t, integrating over {Z^} in (it, oo) and sum- 
ming over M. The asymptotic behaviour of (l m in) is ex- 
tracted by analysing the leading singularity near s = 
in the above equation — that yields (lmin) ~ to [r(l — 
a/i^it/ro) 1 - 01 / 2 for large *. 

Figure[3]compares the analytical asymptotic expressions 
of the averages mentioned above, with the respective val- 
ues obtained from a numerical simulation of a CTRW with 
a = 2/3. 

In conclusion, we have shown that the statistics of 
records for a correlated time series generated by a CTRW 
display rather rich behaviour. We expect the results to be 
relevant to a wide class of problems. For example, CTRW 
has been recently used to model various time series that 
occurs in finance and economics 24 , and hence our results 



would be useful in analysing those financial time series. 
For various contexts (e.g., in regard to the issues of "cli- 
mate change with a warming trend" ), it may be interesting 
to extend the study to non-symmetric </)(£) — using the 
generalised Sparre- Andersen theorem (see Ref. [29]). It is 
interesting to mention that for the particular case of the 
Cauchy distributed jumps with a constant drift \i — i.e., 
the jumps are drawn from a non-symmetric Cauchy PDF 
0(£) — K~ 1 a/[a 2 + (£ — m) 2 ] — one obtains similar results, 
namely, a/2 in the results of this paper is replaced by 
a[l/2 + 7r _1 tan _1 (///a)] in that caseQ Finally, it is useful 
to compare the statistics of the number of records of the 
CTRW with that of a time series of IID random entries 
drawn from p(x) in continuous time with the waiting time 
between successive entries drawn from p(r). In the latter 
case, it turns out that the distribution of the number of 
records P(M, t) does not depend on p(x) and for large t, it 
approaches a Gaussian around its mean (M) ~ aln(t/ro) 
with a variance (M 2 ) — (M) 2 ~ aln(t/ro) — therefore, 
the qualitative features do not change drastically as one 
changes a. In contrast, for the CTRW we have found that 
the qualitative behaviour depends crucially on the param- 
eter a. 
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